Playing a quantum game on polarization vortices 
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The quantum mechanical approach to the well known prisoners dilemma, one of the basic ex- 
amples to illustrate the concepts of Game Theory, is implemented with a classical optical resource, 
nonquantum entanglement between spin and orbital degrees of freedom of laser modes. The con- 
cept of entanglement is crucial in the quantum version of the game, which brings novel features 
with a richer universe of strategies. As we show, this richness can be achieved in a quite unexpected 
context, namely that of paraxial spin-orbit modes in classical optics. 
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Numerous quantum information protocols rely on en- 
tanglement, a property usually attributed to composite 
quantum systems that cannot be described by a tensor 
product state vector. Quantum cryptography and tele- 
portation are popular examples of protocols relying on 
quantum entanglement. While this property is consider- 
ably sensitive to local measurements performed in each 
party of the composite system, it is unaffected by local 
unitary operations. Teleportation protocols, for exam- 
ple, are achieved by classical communication followed by 
local unitary operations and measurements. This frame- 
work is where Quantum Mechanics meets an important 
area of applied Mathematics, the Game theory, a pow- 
erful tool for decision making [l|, Here, two or more 
agents (players) take their decisions by acting on a quan- 
tum system with unitary operations. These decisions or 
conflict situations can be as simple as tossing a coin[l| 
or rather involved like the so-called minority game [3|, 
where one models a competition among several players 
for a limited resource. In this sense games can be co- 
operative or non-cooperative like the prisoners dilemma, 
and with complete (incomplete) information where one 
player knows (does not know) all strategies his opponent 
can choose. Quantum versions of this game was realized 
experimentally both with Nuclear Magnetic Resonance 
[^] and entangled photon pairs [5]. 

Although frequently attributed to quantum systems, 
entanglement has been recently identified in classical op- 
tics as the coherent superposition of paraxial modes with 
orthogonal spatial profiles and orthogonal polarizations. 
We shall refer to such superpositions as spin- orbit modes. 
These modes were used in our group as a classical opti- 
cal resource to investigate the topological phase acquired 
by an entangled state following a cyclic evolution under 
local unitary operations [6] and to demonstrate align- 
ment free BB84 quantum cryptography ref.[7]. Also, a 
spin-orbit Bell inequality has been investigated both in 
the quantum ^] and classical [9] domains. An important 
tool for spin-orbit coupling was used in ref.[lo| to ex- 
change quantum information between these two degrees 
of freedom. In a recent work [11] the term nonquantum 



entanglement has been coined to designate the spin-orbit 
inseparability of paraxial modes in connection with the 
Mueller matrices employed in polarization optics. In this 
work we demonstrate how this kind of nonquantum en- 
tanglement can be used to evaluate the performance of 
quantum strategies in a classical example of game theory, 
the prisoners dilemma. 

A laser beam propagating along the z direction is usu- 
ally described by its polarization unit vector e and its spa- 
tial mode '^(r). The spatial modes in rectangular coordi- 
nates are Hermite-Gaussian (HG) solutions of theparax- 
ial wave equation described in many textbooks [12|. The 
subspace of first order spatial modes has a qubit struc- 
ture similar to the polarization mode space, where HG 
modes along different orientations play the role of linear 
polarizations and the Laguerre-Gaussian (LG) modes are 
equivalent to circular polarization [13]. By combining the 
two mode spaces, a general spin-orbit mode can be writ- 
ten as 



*(r) = ailJh{r)eH ^ f^i^v{T^)eH 



(1) 



where eH{v) ^ire linear polarization unit vectors along the 
horizontal (vertical) directions, and ip^y^{r) are HG spa- 
tial modes along these same directions. In analogy to the 
usual entanglement measure used for bipartite quantum 
states [14], we can define the spin-orbit mode concur- 
rence, which for eq.([T]) is C = \aS — I3j\. We shall refer 
to the spin-orbit modes with maximal concurrence C = 1 
as maximally entangled modes. In these modes neither 
the polarization nor the spatial profile is well defined, in 
fact they correspond to polarization vortices which have 
been extensively studied due to their potential applica- 
tions to high resolution microscopy [TBI, [l6| . In this work 
we demonstrate another appealing feature in the unex- 
pected context of a quantum game. An example of such 
entangled spin-orbit mode is 
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We use this mode to implement a classical optical ver- 
sion of the well known prisoners dilemma, in which two 
players, Alice and Bob, accused of a felony, have to decide 
whether they cooperate (C) or defeat (D) each other. In 
classical game theory, each agent decision is represented 
by one bit of information with possible states C and D. 
Depending on their decision, a reduced penalty may be 
applied to each one of them. The penalty reduction is 
the payoff each player gets from their combined decisions. 
The penalty reductions for both players are shown in ta- 
ble H] for all possible strategies adopted by Alice (rows) 
and Bob (columns). From the table we see that both 
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TABLE I: Penalty reduction table 

players are tempted to choose although their added 
reduction would be maximized by CC. Here comes the 
dilemma, the players are isolated without the permission 
to negotiate. Each player is left to his own and has to 
decide whether to defeat or cooperate with the other, 
a bad choice may cost his freedom. For a cooperative 
game, players would choose strategies which maximize 
both payoffs, i.e., they would search for Pareto optimal 
strategies; on the other hand, since prisoners dilemma 
is a non-cooperative game, each player will try to max- 
imize solely his own payoff, i.e., the intelligent choice 
is the Nash equilibrium DD. At this point, the con- 
cepts of Game Theory are due. Suppose this situation is 
repeated many times and the players adopt probabilis- 
tic strategies, that is, they randomly choose between C 
and D with prestablished probabilities. Then, the payoff 
function of each player is given by the average penalty 
reduction obtained: 



p {m^n) Rj{m^n) 

m^n—C^D 



(3) 



where j = A^B and p (m, n) = PA{'^)pB{n) is the joint 
probability that Alice chooses m and Bob chooses n. In 
the classical approach to the problem, one shows that the 
payoff as a function of pa(^) and Pb(^) has an absolute 
minimum at Pa{D) = Pb{D) = 1, that is the best the 
players can do is to defeat, as a consequence of the severe 
cost brought by a possible betrayment. 

In ref.flT] Eisert proposes an ingenuous alternative to 
the classical approach, employing the concept of quan- 
tum entanglement. Briefly, in this approach the pris- 
oners share a pair of entangled qubits and rather than 
making a definite C or D statement, they are allowed 
to perform single qubit unitary operations (strategies), 
each one on the qubit in his possession. The entangled 



two-qubit state is prepared by a nonlocal operation 
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(4) 



acting on an initial state so that U\CC) = 

{\CC) + i\DD)) /\/2 . After each player has applied his 
own strategy Uj {j = A, 5), the qubits are nonlo- 
cally operated with W and separately measured. The 
payoff function (|3|) is evaluated with the probabilities 
p(m,n) = {mn\U^UA^UB)U\CC) (m,n = C,D) as- 
sociated with the two possible outcomes {C or D) in 
each qubit. Therefore, the strategy space is much larger 
in this quantum approach, it corresponds to the space 
of SU{2) ^ SU{2) matrices, apart from irrelevant global 
phases. This quantum version has been implemented ex- 
perimentally with quantum correlated photon pairs gen- 
erated by parametric down conversion [ISj . 
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FIG. 1: Experimental setup. 

We now demonstrate the implementation of the quan- 
tum game strategies with the nonquantum entanglement 
provided by the spin-orbit classical mode (|2]). In the 
game language, we shall make the identification H = C 
and V = D. The experimental setup is shown in fig.([T]). 
The TEMoo output of a He-Ne laser is diffracted by 
a hologram that generates an HG mode on the first 
diffraction order, so that the initial spin-orbit mode is 
'0/i(r) e-H' This mode is first sent to a quarter wave plate 
(QWP) rotated at 45^, which makes the transformation 
(e// + ^ ey) /a/2, and then to a Mach-Zehnder 
(MZ) interferometer with input and output polarizing 
beam splitters (PBS). A Dove prism (DP) is inserted in 
one arm of the MZ interferometer, and the relative phase 
(j) between the two arms is controlled by a piezoelectric 
transducer (PZT). The DP oriented at 45^ makes the 
transformation tph V^v, so that this MZ interferome- 
ter coherently superposes modes '?/^^(r)ey and il)h{^)^H 
at its output with an adjustable phase (j). In the basis 
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{ijh (r) ch , ^Pv (r) en , V^/^ (r) ey , (r) ey } , the matrix rep- 
resentation of the MZ transformation is: 
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(5) 



SO that after passing through the QWP and the balanced 
interferometer {(j) = 0), the beam is prepared in mode 
([2]). This mode is the object on which the players will 
implement their strategies. 

In order to continue our experimental description, it 
is essential to define the mode converter operators which 
will be used to implement the players strategies and W . 
We will be dealing with either polarization (wave plates) 
or spatial (DP and cylindrical lenses) mode converters, 
that is, elements acting on one degree of freedom only. 
We shall inform rotation angles in degrees and phase re- 
tardations in radians for immediate identification of the 
physical meaning all over the experimental description. 
When oriented along horizontal and vertical directions, 
they introduce a retardation phase (j) between H and V 
modes. A mode converter rotated by the angle 6 is de- 
scribed by the SU (2) matrix: 



COS 2 + ^ sin ^ cos 20 
i sin ^ sin 20 



i sin ^ sin 20 
f - zsinf cos2l9 



(6) 

For example, quarter wave plates correspond to ^ = 7r/2 
and half wave plates (HWP) to <p = tt. Spatial mode 
converters can be made with cylindrical lenses [19] for 
variable retardation ^, or the DP for cj) = ir. Now, Al- 
ice is equipped with polarization elements and realizes 
strategies of the kind C{Oa^ 0a), whereas Bob is equipped 
with DPs and cylindrical lenses, and his strategies are 
C{Ob,(I>b)' After the players have made their choices, 
the spin-orbit mode of the laser beam is: 

^i(r) = [C{OA,^A)^C{OB,^B)]^o{r) . (7) 

The mode converters are also used together with 
MZ{(j)) to implement . Indeed, one can easily show 
that: 

= MZ(0) [C(-45^ 7r/2) ^ /] MZ{7t/2) , (8) 

which physically means that, after passing through the 
players strategies, the beam is sent through a MZ inter- 
ferometer with a 7r/2 phase shift, a quarter wave plate 
(QWP) rotated at —45^, and another MZ interferometer 
with balanced arms (0 = 0), where it gets transformed 
to mode 



*2(r) = f/^*i(r) 



171,71 



Cmn V^m (l*) ^r, 



(9) 



After this transformation, we arrive at the measure- 
ment stage where the probabilities p(m, n) used in the 
payoff function eq.([3]) are obtained. In our setup, these 
probabilities are given by the projected intensities: 



p(m,n) |c„ 



/ 



dFv C(r) [e:-*2(r)] 



(10) 



which, in fact, correspond to photodetection probabilities 
when the beam is attenuated down to the single photon 
regime. In order to measure the projected intensities, 
the ^2(1*) mode is first sent to a PBS where polarization 
projection is performed. Then, each PBS output is sent 
to a balanced Mach-Zehnder interferometer with an ad- 
ditional mirror in one arm (MZIM), where spatial mode 
projection occurs fioj. The projected intensities are mea- 
sured at the four outputs of the two MZIMs either with 
a CCD camera or with photodetectors. The payoff func- 
tion is evaluated with the intensities measured with four 
photodetectors. The background noise is subtracted and 
the partial intensities are then normalized to the total 
intensity so that p(m, n) = Im,n/ ^tot- 
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FIG. 2: a) Coefficients table of the final mode in the compu- 
tational basis, b) Images of the corresponding output ports. 



where m = h^v and n = H^V. 



For each player, we implemented five different strate- 
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gies: iX = C(45^7r), Qi = C(45^7r/2), / = C(l9,0), 
Q2 = C(0,7r/2), and iZ = C(0,7r). In this notation X 
and Z are the usual Pauh matrices. Strategies / and 
iX are equivalent to the classical ones where the players 
can only cooperate or defeat, while the other strategies 
are intrinsically quantum mechanical since they involve 
rotations and phase retardations not available in the clas- 
sical scenario. A table with the coefficients Cmn resulting 
from these strategies is shown in fig.(|2^). Also, the im- 
ages obtained with the CCD camera are displayed in the 
same table format in fig.(|2l3). As expected, only those 
output ports corresponding to nonzero coefficients are 
illuminated. In this sense, the qualitative agreement be- 
tween the two tables is clear. We have also evaluated 
Alice's payoff as a function of the strategy parameters 
(Oa^^^a) and {9b-,4>b) in the domain = 0, < (/> < tt) 
and {9 = 45^,0 < (/> < tt) . The analytical result is 
shown in fig. ([3]) together with the points corresponding 
to all possible combinations of the experimental strate- 
gies. The experimental values were obtained from the in- 
tensity measurements, where the relative intensities play 
the role of measurement probabilities. We observe, from 
both theoretical and experimental results, that the quan- 
tum move Ua = Ub = iZ proposed originally in [I^ 
dominates all classical ones and it is also a Nash equilib- 
rium with a better outcome than Ua = Ub = iX since 
no player can improve his respective payoff by chang- 
ing unilaterally his strategy. Note also that if Bob could 
choose only between classical Ub = I or Ub = iX^ and 
Alice keep Ua = iZ then it would be better for him to 
cooperate since his payoff would be increased. 




FIG. 3: Alice payoff as a function of the strategies parameters 
(^a,0a) and {Ob,cI>b)- Dots correspond to the experimental 
values obtained with the intensity measurements. 

In conclusion, we have used the concept of nonquan- 
tum entanglement to implement a Game Theory task in 
the context of the well known prisoners dilemma. The 
advantages offered by the quantum mechanical approach 



could be realized in the classical optics framework. Non- 
separable spin-orbit modes corresponding to polarization 
vortices were used. This implementation opens promis- 
ing perspectives regarding potential applications of non- 
quantum entanglement to the investigation of quantum 
information protocols. 
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